Competitive analysis of the top-K ranking problem
نویسندگان
چکیده
Motivated by applications in recommender systems, web search, social choice and crowdsourcing, we consider the problem of identifying the set of top K items from noisy pairwise comparisons. In our setting, we are non-actively given r pairwise comparisons between each pair of n items, where each comparison has noise constrained by a very general noise model called the strong stochastic transitivity (SST) model. We analyze the competitive ratio of algorithms for the top-K problem. In particular, we present a linear time algorithm for the top-K problem which has a competitive ratio of Õ( √ n); i.e. to solve any instance of top-K, our algorithm needs at most Õ( √ n) times as many samples needed as the best possible algorithm for that instance (in contrast, all previous known algorithms for the top-K problem have competitive ratios of Ω̃(n) or worse). We further show that this is tight: any algorithm for the top-K problem has competitive ratio at least Ω̃( √ n). Stern School of Business, New York University, email: [email protected] Department of Computer Science, Princeton University, email: [email protected] Department of Computer Science, Princeton University, email: [email protected] Department of Computer Science, Princeton University, email: [email protected] 1
منابع مشابه
Top-k Consistency of Learning to Rank Methods
This paper is concerned with the consistency analysis on listwise ranking methods. Among various ranking methods, the listwise methods have competitive performances on benchmark datasets and are regarded as one of the state-of-the-art approaches. Most listwise ranking methods manage to optimize ranking on the whole list (permutation) of objects, however, in practical applications such as inform...
متن کاملStatistical Consistency of Top-k Ranking
This paper is concerned with the consistency analysis on listwise ranking methods. Among various ranking methods, the listwise methods have competitive performances on benchmark datasets and are regarded as one of the state-of-the-art approaches. Most listwise ranking methods manage to optimize ranking on the whole list (permutation) of objects, however, in practical applications such as inform...
متن کاملTop-k best probability queries and semantics ranking properties on probabilistic databases
There has been much interest in answering top-k queries on probabilistic data in various applications such as market analysis, personalised services, and decision making. In probabilistic relational databases, the most common problem in answering top-k queries (ranking queries) is selecting the top-k result based on scores and top-k probabilities. In this paper, we firstly propose novel answers...
متن کاملEnsemble-based Top-k Recommender System Considering Incomplete Data
Recommender systems have been widely used in e-commerce applications. They are a subclass of information filtering system, used to either predict whether a user will prefer an item (prediction problem) or identify a set of k items that will be user-interest (Top-k recommendation problem). Demanding sufficient ratings to make robust predictions and suggesting qualified recommendations are two si...
متن کاملSensitivity Analysis of SAW Technique: the Impact of Changing the Decision Making Matrix Elements on the Final Ranking of Alternatives
Most of data in a multi attribute decision making (MADM) problem are unstable and changeable, and thus sensitivity analysis can effectively contribute to making proper decisions. Here, we offer a new method for sensitivity analysis of multi-attribute decision making problems so that by changing one element of decision making matrix, we can determine changes in the results of a decision making p...
متن کامل